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ABSTRACT 

It is often claimed that overdensities of (or even individual bright) submillimetre-selected 
galaxies (SMGs) trace the assembly of the most-massive dark matter structures in the Uni¬ 
verse. We test this claim by performing a counts-in-cells analysis of mock SMG catalogues 
derived from the Bolshoi cosmological simulation to investigate how well SMG associations 
trace the underlying dark matter structure. We find that SMGs exhibit a relatively complex 
bias: some regions of high SMG overdensity are underdense in terms of dark matter mass, 
and some regions of high dark matter overdensity contain no SMGs. Because of their rar¬ 
ity, Poisson noise causes scatter in the SMG overdensity at fixed dark matter overdensity. 
Consequently, rich associations of less-luminous, more-abundant galaxies (i.e. Lyman-break 
galaxy analogues) trace the highest dark matter overdensities much better than SMGs. Even 
on average, SMG associations are relatively poor tracers of the most significant dark matter 
overdensities because of ‘downsizing’: at z < 2.5, the most-massive galaxies that reside in 
the highest dark matter overdensities have already had their star formation quenched and are 
thus no longer SMGs. At a given redshift, of the 10 per cent most-massive overdensities, only 
^ 25 per cent contain at least one SMG, and less than a few per cent contain more than one 
SMG. 

Key words: cosmology: theory - cosmology: large-scale structure of Universe - galaxies: 
clusters: general - galaxies: high-redshift - methods: numerical - submillimeter: galaxies. 


1 INTRODUCTION 

Submillimetre-selected galaxies (SMGs; see Casey et al. 2014 for 
a recent review), with typical infrared (IR) luminosities of Lir > 
SxlQi^Lo , represent the rarest and most extreme examples of star 
forming galaxies. The Lir of an SMG implies an immense star for¬ 
mation rate, typically SFR ~ 500 — 1000 Mq yr“^, assuming that 
there is not a significant contribution to Lir from deeply obscured 
AGN. SMGs allow us to probe the mechanisms behind most in¬ 
tense star formation events in the Universe and can elucidate the 
highest SFRs sustainable in a galaxy. Because of their extreme na¬ 
ture, SMGs provide laboratories to test the limits of hydrodynam- 
ical simulations of galaxies (e.g. Narayanan et al. 2010; Hayward 
etal. 2011; Hayward 2013). 

Massive starburst galaxies appear to grow in the most massive 
halos (Hickox et al. 2012), thus making them potential tracers for 
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the highest-redshift proto-clusters (e.g. Capak et al. 2011; Walter 
et al. 2012). Thus, observations of bright SMGs should probe their 
environment and trace significant overdensities that can be inter¬ 
preted in the context of large-scale structure simulations. Therefore, 
SMGs set critical constraints on cosmological models. 

Interest in SMG associations has grown in recent years as in¬ 
creasing numbers of SMG associations have been detected (e.g. 
Blain et al. 2004; Chapman et al. 2005; Chapman et al. 2009; Geach 
et al. 2005; Daddi et al. 2009; Dannerbauer et al. 2014; MacKen- 
zie et al. 2014; Smail et al. 2014). Furthermore, Clements et al. 
(2014) have demonstrated that some Planck sources trace overden¬ 
sities of dusty star-forming galaxies, and they suggested that such 
observations can be used to investigate the epoch of galaxy clus¬ 
ter formation. However, this claim relies on the assumption that 
overdensities of dusty star-forming galaxies correspond to galaxy 
clusters in the process of formation. 

There is some observational evidence that calls this claim into 
question: in their study of the GOODS-N field, Chapman et al. 
(2009) found an association of 8 SMGs at z ~ 1.99. The associ- 
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ated structure was only a typical overdense region, as indicated by 
the well-sampled optical spectroscopy in this region, that would not 
form a virialized cluster hy 2 ; = 0. Moreover, Blain et al. (2004) 
found that the clustering length of SMGs is consistent with that 
of evolved ‘extremely red objects’ (EROs) at 2 ~ 1 and 2 = 0 
clusters, which would suggest that the descendants of SMGs would 
tend to be found in rich cluster environments; however, this inter¬ 
pretation implies a comoving space density of clusters that is at 
least an order of magnitude greater than that observed. These re¬ 
sults suggest that perhaps associations of SMGs trace particularly 
active phases in relatively modest-mass overdensities rather than 
the highest overdensities and thus have a relatively complex clus¬ 
tering bias. 

To test this possibility, we have performed a counts-in-cells 
analysis on the Hayward et al. (2013a, hereafter H13) simulated 
SMG catalogues to investigate the relationships of SMGs and more 
modestly star-forming galaxies to the underlying dark matter struc¬ 
ture. We first investigate the clustering biases of SMGs and Lyman- 
break-galaxy (LBG) analogues. We then study the properties of in¬ 
dividual associations of SMGs and LBG analogues. 


2 METHODS 


To analyze the bias in the SMG population, we use the mock SMG 
catalogues of HI 3, which were generated by assigning galaxy prop¬ 
erties to dark matter haloes from a cosmological collisionless dark 
matter simulation using subhalo abundance matching and then as¬ 
signing submm flux densities using a fitting function derived from 
the results of performing radiative transfer on idealized hydrody- 
namical simulations. We will summarize the H13 methodology 
here, and we refer the reader to H13 for full details. 

Using halo catalogues from the Bolshoi simulation (Klypin, 
Trujillo-Gomez, & Primack 2011; Behroozi et al. 2013b,c), we 
constructed mock lightcones by starting at eight random locations 
within the simulation and selecting haloes along a randomly ori¬ 
ented sightline with an 84’ x 84’ (1.96 deg^) field of view from 
2 = 0.5 to 2 = 8. We calculated cosmological redshifts, includ¬ 
ing the effects of halo peculiar velocities. We then assigned stellar 
masses (M*) and SFRs using the redshift-dependent stellar mass- 
halo mass and SFR-halo mass relations of Behroozi, Wechsler, & 
Conroy (2013a), which include scatter at fixed halo mass and red- 
shift. We included a simple model for satellite quenching; satellite 
SFRs were reduced by a factor equal to their current subhalo mass 
divided by the peak mass in their subhalo’s mass accretion history. 
We assigned dust masses (Md) to the haloes using the empirically 
based method of Hayward et al. (2013b). Finally, we assigned 850- 
/rm flux densities (Sgso) using the following fitting function, which 
was derived based on the results of performing dust radiative trans¬ 
fer on hydrodynamical simulations of idealized disc galaxies and 
mergers (Hayward et al. 2011, 2013b); 


•S '850 = 0.81 mjy 


SFR 


100 Mq yr“ 


f Md 


10» Mq 


( 1 ) 


where we incorporated the scatter in the relation of 0.13 dex (Hay¬ 
ward et al. 2011) when assigning 5 s 5 o- Note that because Sgso 
scales sublinearly with both SFR and Mg, the predicted S '850 val¬ 
ues are relatively insensitive to the model details. Furthermore, the 
Sgso-M* relation predicted in this manner agrees well with that 
observed (Davies et al. 2013). 

Throughout this work, we refer to mock galaxies with Sgoo > 
3 mJy as SMGs (the median SFR for sources with Sg 5 o ~ 3 


^ • SMG 



1 


-2 0 2 4 6 8 10 

^mass 

Figure 1. Top: Overdensity of LBGs (0.1 mJy < Sgso < 1 rnJy) vs. over¬ 
density of dark matter, 5masa, for the random cells. There is a clear correla¬ 
tion between 5gaiaxy and dmass, which indicates that the clustering of the 
LBGs traces the clustering of the dark matter well. Bottom: Similar to the 
top panel, but for SMGs (Sgso > 3 mjy; red circles) and a random subset 
of LBGs selected to have number density equal to that of the SMGs (blue 
diamonds). (The SMG points have been shifted upwards by 5 for clarity.) 
The SMGs and random subset of LBGs exhibit a large scatter in <5gaiaxy 
at a given dniaas- This result indicates that because of the rarity of SMGs, 
Poisson noise causes SMG overdensities to be poor tracers of dark matter 
overdensities. 


mJy is ~ 140 Mq yr“^) and those with 0.1 < Sgso < 1 
mJy as LBGs (this range corresponds to median SFR values of 
~ 10 — 50 Mq yr“^). We study the bias of SMGs and LBGs and 
identify SMG and LBG associations (or redshift spikes; e.g. Chap¬ 
man et al. 2005; Chapman et al. 2009) using a simple counts-in- 
cells analysis (e.g. Adelberger et al. 1998). Specifically, we divide 
each of the 8 mock catalogues into cells with angular dimensions 
10 arcmin x 10 arcmin and depth dz = 0.05; the results are similar 
if we use cells with side lengths equal to twice these values. We use 
a subset of 10,000 of these cells for calculating the clustering bias 
and for making comparisons to the properties of SMG and LBG 
associations; we refer to these cells as ‘random cells’. To identify 
associations, we start with the same cells. However, to ensure that 
we do not divide potential associations by using a fixed grid, we 
shift the cells by 1-arcmin intervals 10 times and define an SMG 
(LBG) association as the galaxies contained in the cell that contains 
the maximum number of SMGs (LBGs). We ensure that we do not 
count a single association multiple times. We calculate total dark 
matter masses for each cell by summing the dark matter masses of 
all haloes of mass > 10^° Mq (because the Bolshoi simulation is 
incomplete below this halo mass) contained in the cell. 

As discussed in detail in H13, the H13 model does not in¬ 
clude the effect of starbursts (i.e. the extended tail to high SFR 
at a given stellar mass and redshift). Because one would expect 
that interaction-induced starbursts would most affect the SFRs of 
SMGs in highly overdense regions and thus potentially alter our re¬ 
sults, we have extended the H13 model by including a model for 
interaction-induced starbursts. For each galaxy, we check whether 
it has a neighboring galaxy with stellar mass within a factor of 3 of 
its own (such that the pair would constitute a ‘major’ merger) that 
is located within a physical distance of dweak- If so, the SFRs of 
both galaxies are boosted by a factor of fiweak. If the separation is 
less than datrong < dweak, we instead boost the SFRs by a larger 
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Figure 2. Spatial distributions of galaxies near the 2 richest SMG associations (which each contain 6 SMGs; the cells are marked with dashed lines). The 
SMGs (LBGs) are denoted with red (blue) points, the sizes of which are proportional to Sgso- The spatial distributions of the galaxies reflect the filamentary 
structure of the dark matter distribution. 


factor, bstrong > feweak- Wc experimented with different reasonable 
parameter values, as judged based on the results of idealized hydro- 
dynamical simulations of mergers (e.g. Cox et al. 2008; Torrey et al. 
2012; Hayward et al. 2014), and found that the results were qual¬ 
itatively unaffected even for the extreme scenario of dweak = 15 
kpc, feweak = 10, dstrong = 5 kpc, and dstrong = 100. However, it 
is important to note that the catalogues are incomplete for mergers 
with small separations (Behroozi et al. 2013b), which would result 
in an underestimate of the number of interacting galaxies. Never¬ 
theless, this incompleteness likely does not affect our results be¬ 
cause although interactions could boost the submm fluxes of some 
galaxies and increase the clustering signal on < 10 kpc scales, the 
clustering on larger scales should be unaffected. In all figures, we 
show the results for the original HI3 model, but the corresponding 
plots for the boosted models are similar. 

Note that unlike H13, we have not incorporated the effects of 
blending of multiple galaxies into a single submm source in this 
work, although theoretical arguments (Hayward et al. 2011, 2012, 
2013b; H13) and observations (e.g. Karim et al. 2013; Hodge et al. 
2013) suggest that blending significantly affects the single-dish- 
detected SMG population. The reason is that the sizes of the associ¬ 
ations are much greater than the beam sizes of single-dish submm 
telescopes (see below). Thus, although the detailed results could 
be affected by blending, our conclusions would be unchanged if 
blending were incorporated. Furthermore, we wish to analyze how 
well individual submm-bright galaxies, which would be resolved 
by e.g. the Atacama Large Millimeter Array, rather than blended 
submm sources (which depend on the beam size of the instrument 
used to detect them and are thus a less general population than re¬ 
solved sources) trace dark matter structures. 


3 RESULTS 


For each cell, we calculate the number overdensity of SMGs 
(5smg) and LBGs (iJlbg) using the following equation: 


Ogalaxy — 


-^galaxy -^galaxy ^ 

^galaxy ^ 


( 2 ) 


where A^gaiaxy is the number of galaxies in a cell and < A^gaiaxy > 
is the mean number of galaxies per cell. We also calculate the dark 
matter mass overdensity of each cell, 


Table 1. Demographics of SMG associations 


(Vsmg“ 

Percentage* 

Mass"^ 
(IQi® Mo) 

Separation'^ 

(Mpc) 

1 

64 

5.9 ±3.0 

- 

2 

23 

7.7 ±3.6 

7.6 

3 

10 

9.1 ±4.7 

6.1 

4 

2 

10.3 ±2.8 

5.2 

5 

0.8 

12.6 ±2.6 

4.3 

6 

0.2 

14.1 ±5.6 

4.6 


" Number of SMGs in a cell. ^ Percentage of SMGs in such as¬ 
sociations. ^ Median mass of such associations at 2 < 2 < 3. 
^ Median paimise separations of SMGs in such associations. 




mass 


ALdm— < Mdm > 
< Mdm > 


(3) 


where Mdm is the mass of dark matter in a cell and < Mdm > is 
the mean dark matter mass per cell. 

Fig. 1 shows the overdensity 5gaiaxy of SMGs (S'sso > 3 mjy; 
red circles), LBGs (0.1 mJy < Sgso < 1 mJy; black diamonds) and 
a random subset of LBGs selected to have number density equal to 
that of the SMGs (blue diamonds) vs. overdensity of dark matter, 
5mass, for the random cells. For the total LBG population, 5gaiaxy 
and Jmass are tightly correlated, which indicates that LBG over¬ 
densities are good tracers of dark matter overdensities. The slope 
of the best-fitting linear relation (i.e. the bias, h = 5gaiaxy/5mass) 
is 0.98 ± 0.01, and the mean squared error (MSB) is 0.1. For the 
SMGs and random subset of LBGs, there is a correlation between 
^galaxy and dmass, but it exhibits significant scatter. The bias values 
are 1.3 ± 0.1 and 1.0 ±0.1 for the SMGs and LBGs, respectively, 
and the MSB values are 17 and 16. The fact that the SMGs and ran¬ 
dom LBGs exhibit similar scatter indicates that Poisson noise due 
to the rarity of SMGs is the reason for the complicated relation¬ 
ship between Jgaiaxy and iJmass for this population. Thus, although 
SMGs are slightly more biased than LBGs, SMG overdensities are 
poor tracers of the underlying dark matter overdensities. This ef¬ 
fect may explain the results of Blain et al. (2004) discussed above. 

We now investigate the properties of individual SMG and 
LBG redshift associations in detail. Table 1 presents the fraction 
of SMGs in associations and the median separation of the SMGs 
in the different types of associations. Typical SMGs are not in as- 
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Figure 3. Left: total dark matter mass in a cell versus redshift of the cell for cells that contain ^ 5 SMGs (red circles), cells with ^ 195 LBGs (this number was 
selected to yield the 10 richest LBG associations; black circles), cells that contain exactly 100 LBGs (green circles) and randomly selected cells (blue points). 
The redshift and dark matter mass distributions are shown next to the respective axes. Compared with the richest LBG associations, the SMG associations 
trace less-massive, higher-redshift structures. Associations of 100 LBGs trace lower-mass dark matter substructures that span the full redshift range considered. 
Right: similar to the left panel, but with cells classified according to the number of SMGs that they contain. Cells with lower numbers of SMGs tend to include 
less dark matter. Notably, some of the most overdense regions contain no SMGs. 


sociations, but a substantial minority (36 per cent) are. Only a few 
per cent of SMGs are located in rich associations of four or more 
SMGs. The richer associations exhibit lower median separations, 
which suggests that these associations correspond to higher over¬ 
densities (which will be confirmed below). Fig. 2 shows the spatial 
distributions of the SMGs and LBGs in two mock SMG associ¬ 
ations. The associations each contain 6 SMGs. The spatial distri¬ 
butions of the galaxies exhibit clear filamentary structures, which 
reflect the underlying structure of the ‘cosmic web’. Note that in¬ 
corporating blending (using a typical beam of < 30 arcsec) would 
tend to increase the submm fluxes of the LBGs and blend the SMGs 
with LBGs, but it would not blend any of the bright SMGs. Thus, a 
blended version would potentially contain additional bright SMGs 
and therefore be comparable to observed SMG associations (e.g. 
Chapman et al. 2009; Dannerbauer et al. 2014). 

Our goal is to understand how well SMG associations trace the 
highest dark matter overdensities. To do so, it is instructive to com¬ 
pare the total dark matter mass in individual cells that are identified 
as SMG or LBG associations with the values for randomly selected 
cells. If SMG associations trace the most significant overdensities, 
these cells should contain more dark matter mass than other cells 
at a given redshift. The left panel of Fig. 3 shows the total dark 
matter in a given cell versus the redshift of the cell for cells that 
contain 5 SMGs (red circles), the 10 richest LBG associations 
(^ 195 LBGs in a cell; black circles), cells that contain exactly 100 
LBGs (green circles), and a subset of randomly selected cells (blue 
points). The redshift and dark matter mass distributions are shown 
next to the respective axes. 

It is immediately clear that the SMG associations do not trace 
the most significant dark matter overdensities, although they do 
trace relatively high overdensities. Compared with the richest LBG 
associations, which should be considered analogous to observed 
associations (i.e. redshift spikes) of LBGs, the SMG associations 
tend to have lower dark matter masses (a median of 1.2 x lO^'^ Mq 
for the SMG associations compared with 2.2 x 10^'* M© for the 
195—LBG associations) and are located at higher redshifts (the 
median values for the SMG and LBG associations are 2.4 and 2.0, 
respectively). Furthermore, there are many randomly selected cells 
that have dark matter masses that are comparable to or even greater 


than the values for the SMG associations, whereas the richest LBG 
associations more faithfully trace the cells with the largest dark 
matter masses. 

For comparison, we show more-modest LBG associations that 
contain exactly 100 LBGs (green circles). As expected, these LBG 
associations trace less massive substructures than the richest LBG 
associations. The median dark matter mass of the 100-LBG associ¬ 
ations is similar to that of the SMG associations, 9.1 x lO'^® Mq, 
but the 100-LBG associations span a broader range of redshifts. 

The right panel of Fig. 3 shows the total dark matter mass 
in a cell versus redshift of the cell for cells that contain one or 
more SMGs (coloured according the number of SMGs). This figure 
demonstrates multiple interesting results: first, many of the highest 
dark matter overdensities at a given redshift contain no SMGs (the 
blue points with dark matter mass > 2 x 10^’’ M©). At a given red¬ 
shift, of the 10 per cent most-massive overdensities, only ~ 25 per 
cent contain at least one SMG, and less than a few per cent contain 
more than one SMG. Consequently, finding dark matter overdensi¬ 
ties using SMGs as signposts will cause one to miss many of the 
highest overdensities. Second, cells with lower numbers of SMGs 
tend to have less dark matter. Finally, the minimum mass necessary 
for a cell to host an SMG is ~ Mq, which is consistent with 
the results of inferences from the clustering of real SMGs (Hickox 
et al. 2012). 


4 SUMMARY AND DISCUSSION 

We have used mock SMG catalogues to demonstrate that SMG as¬ 
sociations are poor tracers of the highest overdensities of dark mat¬ 
ter. At higher redshifts {z > 2.5), the richest SMG associations 
trace some of the highest overdensities because the most-massive 
galaxies in those regions are still forming stars rapidly. However, 
such associations are rare, and the majority of the highest over¬ 
densities do not host a single SMG, let alone an SMG association. 
Consequently, SMG associations are highly incomplete tracers of 
the highest overdensities even at z > 2.5. The situation is worse 
at z < 2.5: many of the most-massive galaxies, which reside in 
the highest dark matter overdensities, have already had their star 
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formation quenched. (Independently of redshift, the halos with the 
highest ratio of SFR to halo mass are those with halo masses of 
~ 10^^ Mq at that redshift; e.g. Behroozi et al. 2013a; Moster 
et al. 2013; Sparre et al. 2015.) Consequently, the z < 2.5 dark 
matter overdensities are less likely to contain SMGs. 

In our model, galaxy SFRs are assigned using a redshift- 
dependent SFR-halo mass relation and a model for satellite 
quenching. The parameters of the model are constrained by fit¬ 
ting to a wide range of observations (Behroozi et al. 2013a). 
Consequently, the fact that some fraction of massive galaxies are 
quenched even at 2 ~ 2 is not a prediction. The utility of our 
model is that it can be used to determine the consequences of 
quenching/downsizing for the clustering of the SMG population. 
Furthermore, because we determine the submm flux densities of 
our galaxies self-consistently using a fitting function derived from 
radiative transfer calculations, there is not a monotonic mapping 
between SFR and submm flux density (a galaxy with a relatively 
modest SFR can still be submm-bright if it has sufficiently high 
dust mass). Thus, the results are specific to the SMG population 
rather than just the most rapidly star-forming galaxies (cf. Dave 
et al. 2010). Finally, our model explicitly accounts for the stochas- 
ticity that is inherent in the SMG selection because bright SMGs are 
an extreme population; thus, one may select a galaxy as an SMG 
because it is in a short-lived phase of elevated SFR (perhaps due 
to an interaction) or because it has an especially high submm flux 
density for its SFR and dust mass (because of e.g. an especially ex¬ 
tended geometry). GN20 could be a real-Universe example of the 
latter. Consequently, Poisson noise contributes to the scatter in the 
value of 5smg at a given ^mass and causes some of the most sig¬ 
nificant overdensities to contain few or no bright SMGs. Moreover, 
our model suggests that some the brightest SMGs in the Universe 
may lie in relatively isolated massive dark matter halos, consistent 
with observational findings (Chapman et al. 2015). 

A few other theoretical works have investigated the cluster¬ 
ing of the SMG population. Dave et al. (2010) studied the proper¬ 
ties of the most rapidly star-forming galaxies, which they consid¬ 
ered SMG analogues, in a cosmological hydrodynamical simula¬ 
tion. Because of the tight, monotonic SFR-stellar mass relation for 
star-forming galaxies in their simulation, they effectively selected 
the most-massive star-forming galaxies in their simulation. Conse¬ 
quently, they found that their simulated SMGs were highly clus¬ 
tered and biased, with a correlation length ro ~ lQh~^ comoving 
Mpc and bias of ~ 6. 

Cowley et al. (2015b) analyzed the clustering of SMGs in the 
current Durham semi-analytical model (Cowley et al. 2015a; Lacey 
et ah, in preparation). In this model, bright (Sgso > 4 mjy) SMGs 
at 2 ~ 2.5 exhibit a correlation length of ro = 5.5/i“^ Mpc and 
a bias of ~ 2.5. Both the correlation length and bias are almost 
independent of Sgso (at least for S'gso > 0.25 mJy), which is qual¬ 
itatively consistent with our results (i.e. galaxies in high overden¬ 
sities are not necessarily submm-bright). Interestingly, similar re¬ 
sults were obtained for a previous version of the model in which 
the physical nature of the SMGs was qualitatively very different 
(Almeida et al. 2011).^ However, the current model better matches 
the angular power spectrum of cosmic infrared background (GIB) 


^ In the current model, SMGs are predominantly starbursts driven by disk 
instabilities, and a mildly top-heavy initial mass function is used in star- 
bursts. In the previous model, SMGs were predominantly very gas-rich 
galaxies undergoing starbursts driven by minor mergers, and a flat IMF was 
assumed for starbursts. 


anisotropies Cowley et al. (2015b) than did the previous model, 
which suggests (perhaps unsurprisingly) that the CIB may provide 
more insight into the nature of SMGs than clustering does. 

Granato et al. (2014) performed dust radiative transfer on sim¬ 
ulated (proto-)clusters at 2 > 1 to determine whether they could 
reproduce the properties of the Planck sources of Clements et al. 
(2014). They found that their simulated (proto-)clusters had lower 
total SFRs than inferred from observations. This tension suggests 
that associations of dusty galaxies should be investigated further 
both in terms of theory and observation. 

It is worthwhile to clarify why we have claimed that SMGs 
are ‘poor tracers’ of high overdensities. In our model, the 5- 
SMG associations each lie in one of the most-massive cells at their 
respective redshifts. In this sense, SMG associations do trace high 
overdensities. However, such associations are uncommon (there are 
11 in our ~ 16—deg^ mock catalogue), which is consistent with the 
fact that only a few overdensities of 5 bright SMGs have been 
reported in the literature (Tamura et al. 2009; Chapman et al. 2009; 
Dannerbauer et al. 2009; Ma et al. 2015). Moreover, at 2 < 2.5, 
almost none of the highest overdensities contain > 3 SMGs despite 
a large fraction of the SMG population being located at 2 < 2.5. 
Thus, searching for high overdensities using SMG associations as 
beacons would result in a very incomplete sample at best. 

Overall, our results urge caution when interpreting SMG as¬ 
sociations in the context of large-scale structure. Because of their 
rarity, Poisson noise causes significant scatter in the SMG overden¬ 
sity at fixed dark matter overdensity (i.e. SMGs stochastically sam¬ 
ple the highest overdensities). Consequently, although the highest- 
redshift SMG associations trace some of the highest dark matter 
overdensities at those redshifts, most of the highest overdensities 
do not host SMG associations. At lower redshifts (2 < 2.5), the 
situation is worse: the highest overdensities tend to contain only a 
few SMGs at most, and the majority do not contain a single SMG. 
Thus, if one wishes to identify protoclusters, the complicated bias 
of SMGs makes them less-than-ideal beacons. 
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